An Early Warning System for Vehicle Related Quality Data

نویسندگان

  • Matthias Grabert
  • Markus Prechtel
  • Tomas Hrycej
  • Winfried Günther
چکیده

Vehicle production audit tests, warranty claims and car control unit data are stored in a central data warehouse for data mining analysis. Neural network based part failure rate estimations, adjusted for mileage and seasonality, are used for monitoring warranty claims. Association and sequence analysis connect production audit data, car control unit data and warranty claims for an early detection of quality changes both in production state and car field usage. Calculations are performed via grid computing. 1 Data Driven Quality Management A major aspect of quality management is avoidance of quality defects. Nevertheless, 100% freedom from error in motor vehicle production and during their lifecycles cannot be achieved with these highly complex products. Therefore, a continuous quality monitoring at production time and during the lifecycle is mandatory to early address upcoming quality problems. Vehicle service centers document repair cases with standardized codes, and all reports are later on transferred to the central claim database at least during the warranty period. Beside repair cases, protocols of car control units are read out during service center visits. Newest generation cars collect load spectrum data continuously and reports are later transferred at WLAN hotspots or via GSM to the carrier. Data sources are of different kinds: While failure codes during warranty periods are completely documented and therefore available for all cars, only approximately 1% of manufactured motor vehicles are checked carefully directly before delivery. These production audits happen definitely earlier and allow thus a rapid identification and elimination of quality problems. Therefore, an early as well as detailed quality monitoring system optimally consists of different data sources in different lifecycle stages. An ultimate goal of such a quality monitoring system is beside a rapid detection of already existing problems – to give a reliable forecast of upcoming events or increasing failure numbers as early as possible. “As early as possible” has to be seen in the context of structural conditions of each car component. Upcoming failures are mainly dependant on car age, mileage and – e.g. in commercial vehicles – usage profile. An Early Warning System for Vehicle Related Quality Data 89 The following data sources have been available at DaimlerChrysler for our early warning system: • repair codes during warranty period (called in this paper “DB1”) • messages from control unit codes (called in this paper “DB2”) • results of production audits before delivery (called in this paper “DB3”) An overview of the software system as well as deployed data mining methods will be given in the rest of the article. 2 Models and Methods Overview The early warning system is working on a two-steps base (Fig.1): First, the system is trained with data from the past (usually 24 months): complex mathematical distribution models are learned for warranty claims (DB1) taking into account seasonal changes of failure rate (e.g. air-condition fails usually during summer due to heavy usage). Models are adjusted for mileage and age of the vehicles. For vehicles without warranty entries, mileage has to be estimated since no information is then available in DB1. Furthermore, associations between warranty claims and control unit messages (DB2) as well as production audit results (DB3) are calculated in step one. In a second step, current DB1 data (usually last 12 months) are tested on a monthly base for significant changes against the calculated models of step one (Fig. 2). Grid computing is used for DB1 training and testing (~ 10,000 models are processed during testing only). Furthermore, the probability of a warranty case under the condition of a pre-occurring DB3 (audit result) or DB2 (control unit message) Fig. 1. Model related system architecture 90 M. Grabert et al. event are calculated to quantify the association strengths. Significant associations are determined by a Chi-Square-Test. Warnings for the system user are then generated and displayed. Every warning is a candidate for a possible production weakness and has to be carefully evaluated for necessary consequences by a human expert. 2.1 Software Architecture and Implementation Details Figure 2 displays the EWS software architecture. User and system administrator navigate through the password protected system via standard web browser within the DaimlerChrysler intranet. The system administrator configures the system and starts training or testing steps, when needed. Data is loaded via PERL::DBI from different data sources into a MySQL [1] based data warehouse. Motor vehicles with unreliable date of initial registration or repair date (e.g. before production date) are excluded from later analysis (Data Cleansing) [2]). Fig. 2. Overview on system software architecture Software components (Perl, Unix-tools, MySQL database, gnuplot to generate graphics as well as C programs for mathematical modeling) have been chosen regarding the ease of portability. The system is currently running on a twoprocessor HP A500 with 440MHz, 2 GB RAM under HP-UX 11.00. Time for porting to a different UNIX system (such as Solaris, Linux) is below one day. An Early Warning System for Vehicle Related Quality Data 91 2.2 Car Warranty Claims Mercedes-Benz’s warranty database (DB1) contains the complete repair history of every Mercedes vehicle during warranty time (at least first 12 months after initial registration, another 12 months depending on mileage). Currently data of 17 Mio. cars are collected in the database. Part failures are standardized documented during car repair using over 15,000 so called “damage codes”. The following part of the early warning system solely depends on DB1 data analysis using the above mentioned two-step architecture: 1. Calculate a mathematical model for the damage frequency of each damage code appearing in a specified model series for a given training period (typically 24 months of production). 2. Test the damage data of newly produced cars (e.g. last year of production) for significant distribution changes against the trained models (test period has to be different from training period!). Tests are performed on a car group level. Vehicles in the group should be as similar as possible to guarantee a homogenous random sample for the stochastic modeling. Grouping criteria are: • model series (e.g. E-Class, Sprinter, Actros) • segments within the model series (e.g. Sprinter 2,8t or Sprinter with right hand drive) • engine (power rating) • model type (determined by the first 6 digits of the unique vehicle identification number) Car groups can be defined in database tables by the system administrator, thus enabling the system to be easily adapted to new model series. A distribution model is only built for a car group if enough failure cases are in the specified group for a specified damage code. The level of specialization is ascending for the describing criteria: After a calculation on model series level (lowest level of specialization), a calculation on segment level will be tried and so on. Specialization is aborted if no stochastic model can be calculated for a specific level (too few data). Each stochastic model is finally evaluated by testing the model with the training data: If the trained model is not good enough to pass the test with data used for the learning phase, it will be discarded. Atypical production months (e.g. after production start of a new series) can be excluded from the training period on damage code level. For the Sprinter model series, about 550 stochastic models are typically generated for about 90,000 cars and different damage codes during training. Training phase including data transfer from the central warranty database to the data warehouse (~100MB) takes about 6 days on a single machine. For a final model calculation, up to 600MB of prepared data have to be processed for each car group (estimation of missing mileage or date of initial registration, part failure rate for each month after registration). The parameters of the generated distribution model include no more than 500 bytes. These long training periods could be reduced by 50% with the help of grid computing since calculation of each model is independent from another. 92 M. Grabert et al. During test phase, typically the last 12 months of production will be tested against the trained models of the past. For each production month all available months after registration will be tested separately for a specific car group. A warning is produced when the number of failures for a car group exceeds the model based significance level (figure 3) in a specific damage code. A 12-months test for the above mentioned Sprinter group takes about 1.5 days of computation time. Fig. 3. The x-axis corresponds to months 1 ... 12 after initial registration. Car group, production month and damage code are fixed. The y-axis shows the cumulated failure probabilities (forecasted failure rate, current failure rate and calculated significance boundary). The current failure rate exceeds in two months the significance boundary (month 6 and 7). Thus, two warnings will be generated by the system for the specific car group and damage code 2.3 Production Quality Audit Beside all quality monitoring performed during car manufacturing, approximately one percent of all cars are carefully audited directly before leaving the factory. Found quality issues are encoded according to the warranty damage codes and then entered into a database (DB3). The early warning system for DB3 data should recognize significant changes in production quality and reveal a possible correlation between production failures and later on warranty cases (DB1). An approach similar to the DB1-system is used to monitor production quality: Audit data are separated into training and testing set. Since neither time nor mileage is relevant for any failures directly after manufacturing, the failure probability can simply be estimated using the relative frequency of failures within the training period (usually several months of production). Warnings during the test phase are generated, if the observed number of failures exceeds the expected number of failures. To identify only significant changes in the failure rate, a hypothesis test based on the binomial distribution is applied. 0 ,00 0 0 ,00 5 0 ,01 0 0 ,01 5 0 ,02 0 1 2 3 4 5 6 7 8 9 10 11 12 fo re c a s t fa ilu re ra te s ig n ific a n c e b o u n d a ry c u rre n t fa ilu re ra te Time since intial car registration [months] F ai lu re p ro ba bi lit y fo r a sp ec ifi c pa rt warnings An Early Warning System for Vehicle Related Quality Data 93 The probability for a warranty failure under the condition of a preceding production failure is calculated to identify a possible correlation between both failure types. Significance of correlation is assured by a chi-square test [3]. 2.4 Car Control Unit Messages Modern motor vehicles like the Mercedes S-Class contain up to 60 control units to control important car functions such as the Anti-lock Brake System (ABS), Electronic Stability Program (ESP) and fuel economy. Beside the control task, these units monitor all connected devices (e.g. sensors) for correct functionality. Errors are internally stored and offer valuable diagnostic data at each garage visit. Since 1999, control unit messages are collected in a special central database (DB2) at DaimlerChrysler to early detect upcoming failures in specific morel series. The early warning system for DB2 is built according to the second part of the production audit early warning system: The probability for a warranty failure under the condition of a preceding control unit error message is calculated and significant correlations are identified by a chi-square test. These correlations are valuable forecasts for potentially upcoming part failures and thus a further garage visit can be avoided for the client. Unfortunately, not all Mercedes garages are connected to the DB2 databases. Therefore, not all control unit messages can be collected in the database which makes it impossible to identify changes in the error message distribution. 3 Details of Stochastic Models Failure processes are stochastic processes and thus their appropriate representation is done with help of stochastic models. The focus of interest is whether the frequency of newly occurred failures has increased in comparison with a reference time interval. Theoretically, comparisons with fixed reference values can be done, too. In practice, this fails because of the number of different failure types (about 15000), for which such reference values would have to be specified for each model series. Furthermore, the variability of the vehicle mission (described for example by the vehicle age, monthly mileage rate, or season of the year) would have to be taken into account. So a certain failure rate can be normal with a monthly rate of 20,000 kilometers while it is too high with a monthly rate of 2,000 kilometers. This makes the comparison with a reference time interval the only viable alternative. In statistics, concepts for quantification of statements about the differences between two data sets have been elaborated. This quantification takes place in terms of probabilities. A typical instrument, frequently used in quality control, is the hypothesis test. In our case, it is hypothesized that the quality has not changed. In the statistical language, this corresponds to the assumption that the frequency of the newly occurred failures can be explained by the failure probability distribution observed in the reference interval. To be able to do this, the reference probability distribution has to be represented by a model received directly from the data. For this, various distribution classes have been proposed (Weibull distribution, Gamma distribution, Fatigue Life distribution 94 M. Grabert et al. [4]). Unfortunately, none of these standard distributions is flexible enough to capture the abundance of the really observed failure distributions, two examples of which are presented in Fig. 4. 0 0,01 0,02 0,03 0,04 0,05 0,06 0,07 0,08 0,09 0,1 1 2 3 4 5 6 7 8 9 10 11 12 age in months 0 0,02 0,04 0,06 0,08 0,1 0,12 0,14 0,16 0,18 1 2 3 4 5 6 7 8 9 10 11 12 age in months H a z a r d

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Design and Development of Early Warning System for Desertification and Land Degradation

Early warning systems are key components of strategies to reduce risk. This research, by adopting a systematic approach in the management of the risk of desertification and by including previously developed models and systems, offers an integrated efficient structure in terms of early warning for the risk of desertification as a pilot system for semi-arid areas of west Golestan Province in IRAN...

متن کامل

Cooperative Intersection Collision - Warning System Based on Vehicle-to-Vehicle Communication

Intersection-collision warning systems use vehicle-to-infrastructure communication to avoid accidents at urban intersections. However, they are costly because additional roadside infrastructure must be installed, and they suffer from problems related to real-time information delivery. In this paper, an intersection-collision warning system based on vehicle-to-vehicle communication is proposed i...

متن کامل

Has Tehran Stock Market Calmed Down after Global Financial Crisis?Markov Switching GARCH Approach

We have introduced an early warning system for volatility regimes regarding Tehran Stock Exchange using Markov Switching GARCH approach. We have examined whether Tehran Stock Market has calmed down or more specifically, whether the surge in volatility during 2007-2010 global financial crises still affects stock return volatility in Iran.  Doing so, we have used a regime switching GARCH model.  ...

متن کامل

Towards Developing an Early Warning System for Cucumber Diseases for Greenhouse in China

The integrated management of cucumber (Cucumis sativus L.) diseases play a key role in guaranteeing the high quality and security of cucumber production in greenhouse, moreover, the early warning of cucumber diseases is the chief precondition for IPM (Integrated Pest Management). This paper describes an attempt to develop an early warning system for cucumber diseases in greenhouse. By analysing...

متن کامل

Evaluating the Application of a Financial Early Warning System in the Iranian Banking System

One of the significant problems of banks and investors in Iran is the lack of precise awareness about the financial performance of each bank and the roadmap for improving the conditions. Besides, the undesirable status of the financial performance of banks becomes evident only when the improvement of conditions is complicated. In this paper, a data mining-based early warning system (EWS) model ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004